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Bias correction for estimators of the extremal index 
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Abstract 



We investigate the joint asymptotic behavior of so-called blocks estimator of the extremal 
index, that determines the mean length of clusters of extremes, based on the exceedances over 
different thresholds. Due to the large bias of these estimators, the resulting estimates are 
usually very sensitive to the choice of the threshold and thus difficult to interpret. We propose 
and examine a bias correction that asymptotically removes the leading bias term while the rate 
\J-4 I of convergence of the random error is preserved. 







1 Introduction 



^ I When one analyzes a risk related to extreme values of a stationary time series, then the clustering 
^J • behavior of extremes can be as least as important as the tail behavior of the marginal distribution. 
Q^ . For example, while a flood control basin may cope with a single day of extreme rainfall, an extended 
^^ I period of heavy rain will more likely lead to a flooding of the surrounding area. Similarly, large 
C^ ' negative returns on a stock index over several days may sum up to an overall loss which is much 
worse than the most extreme crash ever experienced on a single day. 

Obviously, there is no single parameter which captures all facets of serial dependence between 
extreme values, and in different applications different features may be of interest. Recently, Drees 
and Rootzen (2010) introduced a very flexible class of empirical processes that are capable of 
describing quite general aspects of extremal dependence. In the present paper, it is demonstrated 
5^ , how the asymptotic theory of these empirical processes can be used to immensely improve the 
performance of well-known estimators of the so-called extremal index, that is the reciprocal value 
of the asymptotic mean cluster size. 

More specifically, let a stationary time series Xi,l < i < n, with marginal distribution function 
(d.f.) F be observed. We assume that F belongs to the maximum domain of attraction of some 
extreme value d.f. G^, i.e., for an accompanying sequence of independent and identically distributed 
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(i.i.d.) random variables (r.v.s) Xi, 1 < i < n, with d.f. F there exist normahzing constants a„ > 
and 6„ G M such that 

r maxi<.<„X.-fe„ ^^| _^ ^^^^ 

I an ) 

as n — )• oo. It is well known that (up to a scale and location parameter) G^ must be of the form 
G^{x) = exp ( — (1 + jx)~^''^) for all x such that 1 + 7X > 0. Let 

Unix) := anX + bn- 

Moreover, assume the following mild mixing condition (a weakened version of Leadbetter's condi- 
tion D): 

There exist coefficients an,i and a sequence /„ = o{n) such that an,«„ — )• as n — )• oo and 

IPJmaxXj > Un(x),maxXj > u„(j;)}— P{ maxXj > u„(x)}-P{ maxXj > u„(x)}| < a„_; 

for all X G M and all /i, /2 C {1, . . . , n} such that maxli < minl2 —I, 1 <l <n — 1. 
Then there exists a constant 6 G [0, 1], the so-called extremal index, such that 

^r maxi<,<„X,-6„ ^^| _^ ^, ^ ^^^ 

I an ) ^ 

provided that the left hand side converges (to an arbitrary limit) for some a; G M. In what follows, 
we will always rule out the degenerate case 6 = which, in the limit, corresponds to clusters of 
extremes with infinite mean length. 

If the extremal index 9 is strictly positive, then usually it may be interpreted as the reciprocal 
value of a limiting cluster size. To see this, note that from (jl.ip and (|1.2p one may conclude 

P{ma,Xi<i<:n Xi > Unix)} . w ^m 

1 - F"^(n„(x)) 

with the convention 0/0 := 1. Indeed, Hsing (1993, Theorem 3.1) proved that under a stronger 
mixing condition this convergence holds uniformly in x. If the following condition holds: 
There exist coefficients an,i and a sequence In = oin) such that an,i„ — )■ as n — )■ oo and 

P(maxXj > Unix) \ maxXj > n„(x)) —P| max Xj > n„(x)} < dn,i 

for all X G M and all /i, /2 C {1, . . . , n} such that max/i < min/2 — /, 1 < / < n — 1, 

then 

P{maxi<i<„ Xi > u} 



sup 



1 



0. (1.3) 



1 - F"^(m) 

Now a Taylor expansion yields 1 — F"^(ii) ~ nOFiu) = 0-E(C„(u)) uniformly for all u G [n„, -F^(l)) 

where C„(m) := X^ILi ^\X- > u\ denotes the total number of exceedances over u, provided n„, — )• 

F^(l) := sup{x G M | F(x) < 1} such that F(u„) := 1 - F(m„) = o(l/n). Hence, in view of (fOI) . 

it follows 

1 P{maxi<j<n Xi > u} 



EiCniu) I Cniu) > 0) nFiu) 



:i.4) 



uniformly for all u S [un,F'^{l)). 

Convergence ()1.4p suggests to estimate 9 by replacing the unknown probability and expectation 
on the left hand side by empirical counterparts. Since we cannot estimate P{maxi<j<„ Xi > u} 
consistently if we observe merely n consecutive r.v.s Xi, 1 < i < n, we must first replace n with 
Vn = o{n) in (ll.4p and adjust u accordingly. Thus we split the sample into rUn = \n/rn\ blocks of 
length Tn and estimate 9 by 

Q .^ ^i=i {max(j_i)^^<j<jv^ Xj > u„} 

l^j=l Z^i=(j-l)r„ + l ^{Xi > Un} 

for a sequence of thresholds m„ satisfying rnF{un) -^ 0, but nF{un) -^ oo. 

This so-called blocks estimator of the extremal index has been intensively studied in the literature. 
Hsing (1993) and Weissman and Novak (1998) proved its consistency and asymptotic normality 
under suitable mixing conditions. Variants of the blocks estimator were also examined by Smith 
and Weissman (1994) and Robert et al. (2009). As alternatives to blocks estimators, so-called runs 
estimators of 9 have been proposed. While, in the numerator of the right hand side of (jl.Sp . the 
number of clusters of extremes is defined as the number of blocks of length r„ which contain at 
least one exceedance, in the runs approach two exceedances are considered to belong to different 
clusters if they are separated by at least f„ consecutive observations that do not exceed u„: 

2^i=i {Xi > Un, Xj < n„ for alH + 1 < j < i + f„} 

^i=l ^{Xi > Un} 

The asymptotic behavior of this estimator was examined by Hsing (1993), Smith and Weissman 
(1994) and Weissman and Novak (1998), among others. Yet another approach was suggested by 
Ferro and Segers (2003), who used interarrival times between exceedances to estimate the extremal 
index. 

In all these papers, the behavior of the estimators was analyzed for a fixed sequence of thresholds. 
Below we will argue that the analysis of the joint behavior of blocks estimators for different 
thresholds does not only provide deeper insight, but that it is the key to a remarkable reduction 
of the bias. 

Indeed, all the estimators mentioned above are plagued by serious bias problems, which often 
renders inconclusive the analysis of the strength of extremal dependence. As a typical example, 
consider the following autoregressive time series of order 1 with Cauchy innovations et'- Xt = 
ipXt-i + £t with ip = 0.6. Figures [T] (a) and (b) display blocks and runs estimates of 9 based on 
the exceedances over F^{u) = X„_|-„„-|_,_x.„ as a function of u for several block lengths r„, resp. 
run lengths f„. (Here Fn denotes the empirical d.f. and Xi-n the ith smallest order statistic.) The 
true value 9 = 1 — if is indicated by the horizontal lines. The estimates are almost monotone 
functions in u and monotonically increasing in the block lengths Vn, resp. run lengths f„. (The 
latter monotonicity holds by construction if n is divisible by Vn resp. if the last f„ observations 
do not exceed the threshold.) Since there is no region where the estimates remain stable, it is not 
obvious how to choose the threshold appropriately. Without an objective procedure for choosing 
the threshold, it will thus be difficult to justify any particular estimate for the extremal index. 
In Section [3] we suggest a method to combine blocks estimators that are based on the exceedances 
over different thresholds in a suitable way such that the leading bias term cancels out for many 
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Figure 1: Blocks estimator (left) with block lengths r = 5 (blue solid line), r = 10 (red dashed) and 
r = 20 (black dash-dotted), runs estimator (middle) with run lengths f = 2 (blue solid), f = 5 (red 
dashed) and f = 10 (black dash-dotted), and bias corrected blocks estimator (right) as functions 
of the standardized threshold for a AR(l)-times series with (p = 0.6 and Cauchy innovations; the 
true extremal index equals 1 — (/? = 0.4. 

well-known time series models. In Figure [1] (c) the resulting estimates based on exceedances over 
F^{u) are shown (again as a function of u) for the same block lengths. Obviously, the estimates 
are not only almost constant for a wide range of thresholds, but they also vary much less with the 
block length than the original blocks estimator. 

The remainder of the paper is organized as follows. In Section 2, we establish a limit result 
for processes of blocks estimators indexed by the threshold. To this end, we represent the blocks 
estimators as functionals of a suitably defined empirical cluster process. Then the joint asymptotic 
behavior of the blocks estimators easily follows from a general limit theorem of such processes 
proved in Drees and Rootzen (2010). In the main Section [3] we first show that often the leading 
bias term of the blocks estimators is a power function of the threshold. We then introduce a 
method to remove the leading bias term of the blocks estimators in that case without deteriorating 
the rate of convergence of the random error part. All proofs are collected in Section [H 



2 Joint asymptotics of blocks estimators 

In this section we want to analyze the joint asymptotic behavior of blocks estimators over a whole 
continuum of thresholds. Since here we are interested in the extremal dependence (and not in the 
marginal tails), the results should be invariant under strictly increasing transformations of the 
observations. Hence it is natural to parameterize the thresholds in terms of the marginal quantile 
function F^ , that is to consider 



Errin -1 



^n,t 



max 



(j-l)''"n<«<j 



r^X,>F^{l-Vnt)] 



i=l Z^i=0-^l)r„+l ^{X, >F^{1- Vnt)} 



< t < 1. 



For later applications, though, it is more convenient to examine a version where the unknown 



quantile function is replaced with an empirical analog: 



7n,i 



0<t < 1. 



If there are no ties among the largest \nvn\ observations and none of them are among the last 
n — mnTn observations, then On t can be rewritten as 



^n,t 



.7=1 



In particular, this representation holds with probability tending to 1 if we assume that F is 
continuous on some neighborhood of F^(l) and TnVn — >■ 0, which we will do throughout the 
remainder of the paper. 

For sufficiently large n, we then have 



^i=i {max(j_i)^^<i<jv^ Ui>\- Vni] 



n,t \-^rr).r^ ^-^JVn 



j = l 2^i=(j-l)r„ + l ^{Ui > 1 - Vnt} 

where the random variables Ui = F{Xi), 1 < i < n, have a distribution which equals the uniform 
distribution in a neighborhood of 1. Thus this blocks estimator can be expressed in terms of certain 
empirical processes of cluster functionals that have been introduced and analyzed by Drees and 
Rootzen (2010). To this end, define standardized excesses 

._ {U,-{l-Vn))+ _ {U^-{l-Vn))'^0 . . , . „ 

Vn "^n 

blocks thereof 

and functionals on My := IJ^gp^M' by 

ft{xi,...,xi) := l{maxi<i<i Xj > 1 - t} 

gt{xi,...,xi) := Y.'^{xi>l-t]- 



i=l 



Then 



"'* mn^ Er=i 9t{Yn,j) E{gt{Yn,i)) + inVn)^/^mn^Zn{gt) ' 



(2.1) 



where for a generic functional h : My — ?■ M we define 



1 rn„ 
Zn{h) := —= Y, iKYnj) - Eh{Yn,j)). 



J=l 



Under suitable conditions on the time series and the family Ti of functionals h, Drees and Rootzen 
(2010) proved convergence of the empirical processes {Zn{h))hen to a centered Gaussian process 
with continuous sample paths. 



Here we recall conditions that ensure the convergence of the processes (Z„(/j), Zn{gt)) q^^^^- Note 
that {Znigt))o<t<i is the usual tail empirical process, whose asymptotic behavior has been inves- 
tigated by Rootzen (1995, 2009) and Drees (2000). 

(CI) The /3- mixing coefficients 

(3n,k:= sup e( sup \P{B\Bl^^) - P{B)\) 

l<l<n-k-l ^BeB-,^^^, ' 

of the vector of excesses (X^ — F^(l — z;„(l — e)))^<;j<„ satisfy lin,in''^/fn — ^ for some 
sequence /„ = o{rn)- Here B^^^ denotes the u-field generated by (X^ — F^(l — z;„(l — 
^)))t<k<j for some e > 0. 

(C2) r„ -> oo, rnVn -^ 0, nvn -^ oo. 

(C3.1) For some e > 

-—Cov^^l^X^ > F-(l - t;„(l - «))}> E 1{X, > F-(l - t;„(l - t))} 
1=1 1=1 

-^ Cg{s,t) V -e < s,t < 1. 
(C3.2) For some e > 

;rr-Co^(l{maxi<,<,„ X, >F^{1- i;„(l - s))}' E ^jX, >F^{1- t>„(l - t))} 

—7- Cfg{s, t) V — £ < S, t < 1. 



(C4) There exists a bounded function /i : (0, 1] — t- M such that limt_!.o h{t) = and for sufficiently 
large n 

;r^^(EV-(l-.„(l-.))<X,<F-(l-.„(l-t))}) <^(^-^) V-e<.<t<l. 
" "■ i=i 

Theorem 2.1. (^ij Under the conditions (CI) and (C2), (^n(/i))o<t<i converge weakly to Zf : = 
{v9Bt)o<t<i with B denoting a standard Brownian motion. 

(a) If the conditions (CI), (C2), (C3.1) and (C4) are met andrn = o{yJnVn), then {Zn{gt))o<t<i 
converge to a centered Caussian process iZ{gt))o<t<i with covariance function Cg. 

(Hi) If the conditions (C1)-(C4) are satisfied andrn = o{y/nvn), then {Zn{ft),Zn{gt))o<t<i con- 
verge weakly to {Zf{t), Zg{t))o<t<i with 

Cov{Zf{s),Zf{t)) = e{sAt), 

C0V{Zg{s),Zg{t)) = Cg{s,t), 

Cov{Zf{s),Zg{t)) = Cfg{s,t), 0<s,t<l. 
Remark 2.2. The covariance conditions (C3.1) and (C3.2) are fulfilled if all finite dimensional 



marginal distributions (Xi, . . . ,Xk) belong to the domain of attraction of some multivariate extreme 
value distribution, lim„_>oo lini sup^_j.(^ Pn,m = 0, and the following condition holds: 

(C5) For Sonne 5 > 

/ JA \ 2+<5 

i=l 

In this case, Segers (2003) has shown that the conditional distributions py^",'n<i<k\Uri,i7^0 of {Un.i)i<i<k 
given that the first observation exceeds the threshold converge weakly to the distribution o/(Wj)i<j<fc = 
{Vi VO)i<j<fc, where {Vi)i<i<k is the so-called tail sequence pertaining to the time series Ui, i G N. 
The limiting covariance functions Cg and Cfg are then given by 

oo 

Cg{s,t) = s^t + ^{P{Wl>l-s,Wk>l-t] + P{Wl>l-t,Wk>l-s}), (2.2) 

fc=2 

P{Wi > 1 - t, maxj>i Wj >1- s} 
c/9(s,i) = { +i2T=2P{Wi>l-s,Wk>l-t,ma^j>2Wj<l-s}, s < t, (2.3) 
t s>t. 

Using the joint convergence of Zn{ft) and Zn{gt) and the representation (j2.ip . one can easily derive 
a limit theorem for the processes (^j^i)o<i<i of blocks estimators. 

Corollary 2.3. Under the conditions of Theorem \2.1\ (Hi) 



{Vnv^t{6^^ - 0n,t)) Q^-t^-^ -^ Z ■■= Zf - 9Zg weakly as n^oo 
E{ft{Yn,i)) _ P{maxi<i<,„ X, > F^{l-Vnt)] 



'0<t<l 

with 



7n,t 



E{gt{Yn,l)) rnVnt 

The limit process Z is Gaussian with E{Z{t)) = and 

Cov{Z{s),Z{t)) = 9{sAt- Cfg{s, t) - Cfg{t, s)) + e^Cg{s, t) =: c{s, t). (2.4) 



Note that the centering constant 6n,t, which is the leading term in the representation (12. ip . con- 
verges to 9 uniformly for all t G (0, 1] by Hsing's (1993) result (|1.4p . However, the convergence can 
be rather slow leading to a large bias of the blocks estimator as observed in Figure [TJ 
In the next section we will see how to combine all blocks estimators 0* j non-linearly such that the 
resulting estimator has a much smaller bias. As the threshold F'^{1 — Vnt) is unknown, for any 
given threshold Un in the definition (jl.5p it is not known for which index t one has 9n = 9^f Hence, 
we first need an analog to Corollarv 12.31 for the estimator 9n^t with random threshold Xn-TnvntV.n- 
To this end, we analyze the difference between the deterministic threshold 1 — f„t (after stan- 
dardization of the marginals) and its random counterpart 1 — Un-\nv„t]:n- It has been shown in 
Drees (2000), proof of Corollary 3.1, that ^/nv^[{l — Un-lnv„t]■.n)/vn — t)Q^f-^l converges to a Gaus- 
sian process if {Zn{gt))~s<t<i converges to a Gaussian process. Note that the latter convergence 
follows from an analog to Theorem 12.11 (ii), because the conditions (C3.1) and (C4) have been 
formulated for s,t G [— e, 1] (while for Theorem 12. II (ii) to hold it suffices to require the conditions 
for s,t G [0,1]). This suffices to establish a limit theorem for 9n^f It turns out that under a 
suitable continuity condition on 9n,t, the blocks estimator with estimated threshold has the same 
asymptotic behavior as 0* j. 



Corollary 2.4. Suppose the conditions of Theorem \2.1\ (Hi) are met. Then 

{^/fm;^t{9n,t - ^n,ii-U^-i„v„t]:n)/vn))o<t<i ^ ^ ^60% asn^oo. (2.5) 

If, in addition, to each to G (0, 1) and each Mi > there exists M2 > such that 

< M2{nvnr^'\ (2.6) 



sup 

s,t>to,|s-i|<Mi(ni)„)-i/2 



7n± 



then 

{■s/nvnt{9n,t - ^n,i)) o<i<i "^ ^ wcakly as n^ 00. (2.7) 

3 Bias correction 

As in Figure [TJ the blocks estimator On^t often exhibits a clear trend, that is caused by its bias, 
when it is plotted versus t. In this section we show how to combine blocks estimators for different 
thresholds such that the leading bias term vanishes while the order of magnitude of the random 
error is preserved. To this end, we make structural assumptions on the form of the bias 0^,* — ^ as 
a function of t. The following examples demonstrate that in time series models discussed in the 
literature the leading bias term often equals a power of t with positive exponent. 

Example 3.1. Let Zi, i G N, be iid r.v.s with d.f. F, and let ^j, z G N, denote a series of iid 
Bernoulli rvs, independent of (Zj)jgN, with P{^i = 0} = V = 1 ~ P{^i = !}• Weissman and 
Novak (1998, p. 285) proved that then the time series Xq := Zq, Xt := ^tZt + (1 - it)Xt-i, t e N, 
is stationary with marginal d.f. F and extremal index 6 = 1 — ^. Moreover, if F is eventually 
continuous, then for all to > 



7n,t 



I - ii - vnt){i - evnty"^-^ ^ e^ i-e ^^ , 2n 



'^n'^n^ ^ 



uniformly for t G [tojl]- If ^n'^n -^ 00, then the linear function — 0^r„t;„t/2 is the leading bias 
term. 

Example 3.2. Consider a finite order moving maxima time series 

Xt = max {ipjZt-j) 

with non-negative coefficients ipj > 0. W.l.o.g. we may and will assume that maxo<j<g f/'j = 1. 
Further assume that the innovations Zt are iid with heavy tailed d.f. Fz satisfying 

Fz{z) := 1 - Fz{z) = ciz-^' (1 + C2Z-^' + o{z~^'')) 

for some /3i, /32, ci > and 02 / 0. 
If 132 < Pi, then 

F{x) := P{Xt < x} 

= P{Zt-j<x/i^jyO<j<q} 



Yl(l- Clix/ll^j)-^^ (1 + C2(x/Vj)-^^ + 0(x-^2) 






1 _ ci ^^f X-'^^ - C1C2 ^^f +^^X-('^^+'^^) + 0(x-(''^+^^)) 
i=0 i=0 



as X — )■ oo, and thus for all fixed A > 









To determine 9nt, check that with 



d:=-^ ' . ..Ax ^-°^^- 



it follows that 

P| max Xt < F^(l - w„t)| 
*■ l<t<r„ '^ 



■{• 



F^{l-Vnt) 
maXov(l_m.)<j<(jA(r„-m) V'j 



Ps Zm < — y 1 — a < m < rr, 



n-4^=^^^)-nV4^=^^)- ft ..f^^<'-' 



„=i_g maxi_„,<j<gVj^ J^j-^ \maxo<j<giJjJ ^^^^tg^^ Vmaxo<j<r„-m V'j 

= n (1 + OM) • (l - / ^^ Vnt - d{Vr^)'+MP. + ^(^l+/3./ft)y"-'^ 

m=l~q ^ ^j=0 ^j 

• n (1+oK)) 

m=r„-ij+l 

Hence, if rnVn ^ — ^ oo but r„f„ ^ — )■ (which implies 1^2 < /?i/2), then for all to > 
1 - P{ maxi<j<^„ Xj < F^(l - t;„t)} _ 1 






+ dMf2//3i+o(^^2//3i) 



uniformly for t £ [^0)1]- Here the the constant d is strictly negative if ipj £ (0,1) for some 
j G {0, . . . ,q}. Hence, in this case, 6 := X/Yl'j=oi^j is the extremal index and the leading term 
of the bias 6n,t — is a multiple of fP2/pi _ 

Now we investigate the general case, i.e. we do not assume that /32 < /^i- By similar calculations 
as above, we obtain that 

q 
F{x) = l + ciY^ ^f x-^i + 0(x^(^i+^2) + x-2/31) 
i=o 

Ah 

=^ Fz{x/A) = ^F{x) + 0(x-(^i+^2) + x"2,9i). 



Therefore 

P{ max Xt < F^(l - v„t)} 






^ 1 / „ ,. -t \ 2 



which in turn implies 



,,, ' \{^S^) +0{v^ + ir.v.f + rM-''^/^^+vl)), 



+ 



n.oV'f nn^ov^ 



32 



6'n,t = 6* - —rnVnt + o{rnVn) 

if rnf™*^^ 2/ ij _^ ^^^ Hence, in this case the leading bias term is a linear function of t. 

Remark 3.3. Theorem 4-1 of Hsing (1993) suggests that indeed for m- dependent time series with 
m-dimensional regularly varying marginal distributions the leading bias term usually is a linear 
function of t i/ r„ — )■ oo sufficiently fast. 

We propose the following estimator of the extremal index with reduced bias: 

On,sOn,tKds,dt) 

/ On,s + On,t Ai(c^S, dt) 

J{0,1]2 

where fi is some finite signed measure on (0, 1]^ satisfying the following conditions: 

(Ml) The signed measure fx^ induced by the product map vr : (0,1]^ — )■ (0,1], vr(s,t) = st, 
vanishes, i.e. /iJTr e B} = for all B G B((0, 1]). 

(M2) f s^ + t^ fi{ds, dt) / for all 5 > 0. 

(M3) The total variation measure |/i| pertaining to fi satisfies JJq ^i2{st)^^ \fi\(ds,dt) < oo. 

Example 3.4. (i) Let F, G be d.f.s of probability measures Qp and Qg on (0, 1] such that 

/(o,i] *"^ Qpidt) < oo, /(o -^j t~^ Qcidt) < oo and /^q ^ t^ Qridt) / /^q ^ t^ Qcidt) for aU 5 > 

rp 

0. (The latter condition is, for instance, fulfilled if Qp equals the distribution Qq of the map 
Th{x) := x/b under Qg for some 6 > 1.) Then the signed measure n = Qp Qg — Qf ^ Qg 
for some a > 1 (i.e., /x((0,x] x (0, y]) = F{ax)G{y) — F{x)G{ay) for all x,y G (0, 1]) satisfies 



10 



the conditions (M1)-(M3): 

fi{{s,t)\st<u} = / G{u/s)Q^p^{ds)- G{au/s)QF{ds) = 0, 

7(0,1] 7(0,1] 

/ s^ + t^ n{ds,dt) = I s^Q]?{ds)+ f t^ Qcidt) - [ s^ Qpids) - [ t^Q^(f{dt) 
J(o,i]2 7(0,1] 7(0,1] 7(0,1] 7(0,1] 



J (0,1] 


i(0,l] 


i(0,l] 


= (a-^- 


-1)(/ s'Qpids)- 
W(o,i] 


- / t'Qcidt) 
7(0,1] 


/ 0, 







/ {st)-^\^J.\{ds,dt) = [ s'^Ql^{ds) f r^QG{dt)+ f s-^Qpids) [ r^Q^^idt) 
7(o,i]2 7(0,1] 7(0,1] 7(0,1] 7(0,1] 

= (1 + a)/ s-^Qpids)- [ t-^QGidt)<oo. 
7(0,1] 7(0,1] 



'(0,1] ^(0,: 

(ii) The above example is a special case of the following more general construction. Let T : 
(0,1]2 ^ D := {{u,v) I < n < t; < 1}, T{x,y) := {xy,y), and let T''^ : D -^ 
(0,1]^, T^^{u,v) = {u/v,v) denote its inverse. Choose some measure v on (0,1] satisfy- 
ing J,Q -j^, s^^ v{ds) < oo, and Ivlarkov kernels Ki and K2 from (0, 1] to (0, 1] such that 

Ki{u, [u, 1]) = 1. Then the signed measure fx := {v Ki) — {v (^ K2) meets the con- 
ditions (Ml) and (M3), because vr = T o pn with pri denoting the projection on the first 

coordinate, and thus ^'^ = ( ((z^ ^ Ki)'^ ) j — ( ((z/ (gi K2) ) J = i^ — u = and 

J(^Q^^2isty^ {ly <^ Ki)"^ {ds,dt) = j^^-^-^2U^^ {v ® Ki){du,dv) < 00. 

Our main result shows that the bias of 9n,^ (and hence its estimation error) is of smaller order than 
the bias of On^t if the bias dominates the random error and its leading term is a power function. 

Theorem 3.5. Suppose that conclusion (|2.7p of Corollary \2.4\ holds and that 



en,t = On + Cnt^ + Rnit) VtG(0,l] (3.2) 

for some 6 > with dn := supo<t<i t|-Rn(i)| = o{cn) and (nt;„)~^'^ = o{cn)- If the conditions 
(M1)-(M3) are fulfilled, then 

I sh'^Z{t) + t^s-^Z{s)n{ds,dt) 

l/2Vl]f 

/ s^ + t^ fi{ds,dt) 
7(o,i]2 

s^Rnit) + t^R„{s) ti{ds, dt) 

/ s^ + t^ fi{ds,dt) 
7(o,i]2 

In particular, if dn = o{{nVn)~^''^), then 

I sh-^Z{t) + t^s-^Z{s)fi{ds,dt) 
7(0,112 

/ s^ + t^ n{ds,dt) 
7(0,112 



'(0,1]2 



+ ^^^^ + OpiinVn)-'^^ + dn). 



V^m^{0n,^-0n) -^ ^-^^ f . (3.3) 
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Remark 3.6. //supo<i<i |i?n(t)| = o[{nVn) ^''^), then assertion (|3.3p holds if merely convergence 
(j2.5p is required instead of (j2.7p . that is, the smoothness assumption (j2.6p on 9n,t is not needed. 



In ()3.3p the leading bias term which depends on the threshold is removed, while the random error is 
still of the order (nw„)^^'^. To analyze the latter, w.l.o.g. we may assume that the signed measure 
H is symmetric, because 9n,^ = On,fi for fl{ds,dt) := ij,{ds,dt) + iJ,{dt,ds) and fl satisfies (M1)-(M3) 
iff n meets these conditions. Then the right-hand side of (j3.3p equals 











I 


' sh- 

(0,1]2 


'^Z{t)^i{ds,dt) 
















/ 


s fi{ds,dt) 






which is 


a centered G 


aussian rv i 


with 


variance 












•rl: 


J{0,1 


]2 J{0., 


{ss)^{ti)~'^c{t,i)n{ds, 


dt) fi{ds. 


,di) 








(/ 


. \ 2 

s fi{ds, dt) 







W(0,l]2 ' 

If /x is the symmetrized version of the signed measure discussed in Example 13.41 (i) with / and g 

rp 

denoting Lebesgue densities of Qp and Qc '■= Qp , respectively, then 

2 _ ( ab \ 

^/^ - l^(i_a-5)(i_5-<5) I ^ 

1 /-i 



JO 



X / / (a-(^+i)/(6t) + 6-(^+i)/(ai) - f{abt) - (a6)-(^+i)/(f)) x 



X (a-(^+i)/(fei) + 6-(^+i)/(at) - f{abi) - {aby^^+^^ f{i)){ti)-^c{t,i) dtdt. 



To estimate this asymptotic variance is essentially as difficult as to determine the asymptotic 
variance of the original blocks estimators. To this end, one may employ ideas developed in Drees 
(2003), but a bootstrap approach, that will be worked out in a forthcoming paper, seems more 
promising. 

Finally, we would like to mention that our approach is obviously not capable of removing the part 
On — oi the bias which does not depend on the threshold but on the block length r„. 

4 Proofs 

Proof of Theorem 12. 1[ We apply Theorem 2.10 of Drees and Rootzen (2010) to prove asymp- 
totic equicontinuity of the processes and Theorem 2.3 to establish convergence of the finite di- 
mensional marginal distributions. To this end, we must verify the conditions required in these 
theorems. 

(i) The assumptions (Bl) and (B2) of Drees and Rootzen (2010) follow from our conditions (CI) 
and (C2). For the functionals ft, condition (C2) of Drees and Rootzen (2010) is trivial. Condition 
(C3) of Drees and Rootzen (2010) reads as 

P{maxi<^<,„ Un,i > (1 - g) V (1 - t)} ^ ^ 
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(cf. Drees and Rootzen (2010), (4.1)). This is immediate from (|1.4p . which imphes 

P{maxi<j<r„ Un,i > l-t} 
— > u 

uniformly for t G (0, 1]. 

Likewise, condition (D3) of Drees and Rootzen (2010) is equivalent to 

,. ,. P{1 -t < maxi<i<.r„C/„,i < 1 -s} 
limhmsup sup ^^ = 0, 

i5iO n^oo 0<s<t<l,t-s<5 fnVn 



(4.1) 



which again is a direct consequence of the uniform convergence (14. Ih . 

The remaining conditions can be verified by the arguments given in Drees and Rootzen (2010), 
Section 4 and the proof of Corollary 4.3. (Note that Z„(/t) equals the random variable 2'„(1 — t) 
defined in Example 4.2 (with A; = 1) of that paper.) 

(ii) This assertion is a reformulation of the results on the univariate tail empirical process given in 
Example 3.8 of Drees and Rootzen (2010). 

(iii) The equicontinuity of the joint process immediately follows from the equicontinuity of {Zn{ft))o<t<i 
and {Zn{gt))o<t<i and a similar remark applies to the conditions (CI) and (C2) of Drees and 
Rootzen (2010). The remaining condition (C3) follows from (C3.1) and (C3.2) of the present pa- 
per and the calculations in part (i) above. □ 



Proof of Remark 2.2. The conditions (C3.1) and (C3.2) follow by similar arguments as in 
Remark 3.7 (ii) of Drees and Rootzen (2010) (cf. also Corollary 2.4 of that paper). Here we have 



c,is,t) = i?(l(i_,,i](TVi)l(i_i,i](W^i) 

oo 



k=2 

which equals the right hand side of (j2.2p . and 



Cfg{s,t) = ii;(l(i _ ..^ i](maxH^i) ^ l^^ _ ^^ ^^{Wk) - 1^^ _ ^^ i](maxt^,) ^ l^^ _ ^^ ^^{Wk 

k=l ~ k=2 

If s > t and the first sum does not vanish, then the first indicator equals 1. Together with a similar 
reasoning for the second sum, one obtains 

Cfg{s,t) = E{l^-^_^^^^^{Wl))=t. 
In the case s < t, direct calculations show that 

Cfgis,t) = i?(l(i_^^i](maxt^,)l(l_t,l](^i) 

oo 

+ (1(1 - s, 1](™T^.) - 1(1 _ ,^ l](maxt^,)) E 1(1 - t, l](^k: 

k=2 
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is equal to the right hand side of (|2.3p . □ 

Proof of Corollary 12. 3[ Using E{gt{Yn,i)) = rnVnt and representation ()2.ip . we obtain by 
simple calculations 

VnVnt{On,t - ^n,t) = — ., . , . ■ 4.2 

"i„r„ 1 + ^/rmn/{mnrnVnt)Zn{gt) 

By the equicontinuity of {Zn{gt))o<t<i and Zn{go) = 0, there exists a sequence r/^ — ?■ such that 
supo<t<(n«„)-i/'* \^n{gt)\ = Op{rin). Heucc, for tn := (r?„/(n^;n))^/^ 

\Zn{gt)\ ^ ( Vn \ , \Zn{gt)\ .-,^ 

sup = Op + sup — - = Op(l), 

SO that the denominator of the second fraction tends to 1 uniformly for t G [t„, 1]. Moreover, since 
both 0* J and 0„^i are bounded, 

sup ^/nv^t\e^^-en,t\=op{l). (4.3) 

o««„ 



Finally, the continuity of Z implies 



sup \Z{t)\ = op{l). (4.4) 

0<t<i„ 



Therefore, in view of (J4.2p - (j4.4p . Theorem 12.11 and the uniform convergence of 9n^t to 9 prove the 
assertion. D 

Proof of Corollary 12.41 Check that under the conditions of Theorem 12.11 (iii) the follow- 
ing equivalences hold on a set with probability tending to 1: Xi > Xn-inv„t]:n *^=^ Ui > 

Un-\nvnf]:n ^^ Un,i > 1 - (1 - t/„-[™„t]:„)/Wn, and thuS 9n,t = ^n,s„it) ^^^^ ■5"(*) •= (^ ~ 

Un-\nvni\:n)/vn- An application of Vervaat's (1972) Theorem 1 to the assertion of Theorem 12.11 
(ii) yields 

y/nV^{Sn{t) - t)Q<t<l — > Zg (4.5) 

(cf. the proof of Corollary 3.1 of Drees (2000)). In particular, Sn{t)/t — )■ 1 uniformly for all 
iG[(n?;„)-V3,i]. 

Moreover, by continuity suPq<;j<(„„^n-i/3 \Z{t)\ — )• and thus by Corollarv 12.31 

VnV^t{9n,t - ^n,s„(t))l[(ni,„)-i/^l](*) = \/^^^n(0 (^n,s„(i) " ^n,s„(i)) " ^~7^1[(™„)-i/3,i] (*) 

^ Z{t) (4.6) 

uniformly for t G [0, 1] . 
Next note that 

sup y/rmnt\9n,t-9n,s„{t)\<{nvn)~'^^'^ — > 0, (4.7) 

0<i<{ni)„)-3/4 
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while for (nvn) ^^^ <t < (nvn) ^^^ 



fllj j] / 777, \ 

Zn{fs^{t)) + —^P{ max Xi > F^(l - Vns)}\s=s„{t)) - y/nv^tSn,s„{t) 



\nvnt\ 

< \Zn{fs„it))\ + 



.JruTr, 

nVnt mnVn 



\nvnt\ n 



Vnu;^Sn{t)en,s„{t) + y/n^\sn{t) - t\0n,s„{t)- (4- 



The first term on the right-hand side tends to uniformly by Theorem 12.11 (i) and the continuity 
of Zf, the last term converges to by (j4.5p and the continuity of Zg. Furthermore, by (j4.5p 



sup 

(n«„)-3/4<t<(nt;„)-l/3 



nVnt ninrn 



\nvnt\ n 



,/T^Sn{t) = Op((nt;„)-i/^ + rjn) ■ Op((™„)i/6) ^ 0. 



Combining this with (I4.6p - (l4.8p . we arrive at the first assertion. 

It remains to prove that under the additional continuity condition on On^t 



y/rm^ sup t\en,s„(t) - On,t\ 
0<t<l 







7n,s "n,t\ 



P{ max Xi > F^(l-Vns)] 



in probability. To this end, first check that 

,„ „ , 1 1 

" ' < 

+^— P|F^(1 - Vn{s V t)) < max Xi < F^{1 - Vn{s A t))| 

TnVnt l<i<r„ 

\t-s\ 1 , , 

< rnVnS-\ :-rnVn\t-s\ 



< 2 



't-s\ 



fn'^'nJ' 



t 



Hence, again by (|4.5p and the continuity of Zg for each 6 > there exists ?? > such that 

P{^/nV^ sup t\9n^sU) - 9n,t\ > S} < 6. 
0<t<ri 



On the other hand, by (j4.5p . assumption (j2.6p and Hsing's result (|1.3I 



V'riv^t\(^n,Snit) - ^n,t| = y/nVnt 



^n,s„{t) 



7n,t 



,,t-0\ = Op{\en,t-9\)^o 



uniformly for t S [rj, 1], which completes the proof. 
Proof of Theorem 13.51 By condition (Ml) 



D 



/ 



M(o,i]') = o. 



(4.9) 
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Thus 

i){On,t -9n)ll{ds,dt) 



_-"n,s ^nj 
-'n,fj, ^n — 



'n,s - On) + iOn,t " On) nids, dt) 

In view of (j2.7p and the integrabUity condition (M3) , the right-hand side has the same distribution 



as 



/ {{nVn)-^l'^s-^{Z{s) + op(l)) + c„/ + i?„(s)) ((nt;„)-i/2^-i(Z(t) + op(l)) + c^f^ + i?n(i)) \i{ds, dt) 



Cn n' s^ + t^ f^ids, dt) + op(l)) + (™„)-i/2 f^ f s-iz{s) + t-iZ(t) fi{ds, dt) + Op(l) 
Because of (|4.9p . the conditions (M2) (M3), (nw„)^^/^ = o(c„) and (i„ = o(c„) this fraction equals 
j{nVn)-^^^{t^s-^Z{s) + /t-iZ(t)) + s^Rn{t) + t^Rnis) ^i{ds, dt) + op{{nVn)-^/^) 

s^ + t^ fi{ds,dt) + op{l) 



Now the first assertion is obvious and convergence (j3.3p is an immediate consequence of the addi- 
tional assumption (i„ = o((nu„)^^") and the integrability condition (M3). □ 



Proof of Remark 13.61 Recall the definition of s„(t) from the proof of Corollary 12.41 

For < 6 < 1 and < u < v one has v — u < v (v — u) < u {v — u) and hence 
\{sn{t)f - t^)\ < t^-'^\snit) - t\. For 5 > 1, the mean value theorem implies |(s„(t))'' - t^)\ < 
S\sn{t) — t\. Combining both inequalities with convergence (j4.5p . we conclude \{sn{t))^ — t^)\ = 
0p[{nvn)^^''^t^^) . Moreover, under the given conditions, Rn{sn{t)) = op[{nVn)~^''^)- Hence, 
On,t — On = {nvn)^^''^t^^ {Zn{t) + op(l)) -|- Cnt^ and we proceed as in the proof of Theorem 13.51 to 
estabhsh (ESI). D 
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